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applied to explain the statistics of stock traded volume at different aggregation scales. 
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1. Introduction 

Complex non-equilibrium phenomena have lured the attention of a large part of the 
physical community in recent years. Despite the knotty character of this type of 
systems, by applying particular techniques, it has been possible to make a physical 
characterization of their leading properties. Among these techniques, let us mention 
the "superstatistical" approach. It originated from the observation, in the context of 
nuclear collisions, that deviations from the standard (exponential) probability density 
function (PDF) could be explained through the fluctuations of an inner parameter, in 
that case, the cross section pp. Because a statistics is made up of another statistics, 
this approach became known as superstatistics, or statistics of statistics. Therafter, 
it was further generalised, endowed with a statistical mechanics interpretation [2] and 
widely applied since then. In fact, for many systems, it is realistic to consider that some 
of the characteristic parameters may be not strictly constant, but instead fluctuant, 
either in time or in space, according to a specific PDF, in a scale much larger than the 
primary stochastic process. The superstatistical approach has been quite successful in 
accounting for observations in fluid turbulence [3], physiology [4j, human activities [5], 
ecology [6], and also in finance [TJIEIE], amid many others. Let us also point out that, 
in economics and social sciences, albeit ad-hoc, statistical mixtures have been taken into 
account for some decades [TO] . 

In this work, we consider as primary process the one given by the stochastic 
differential equation (SDE) 



where W t represents a standard Wiener process, with unitary variance, and 7, 6 and 
5 are positive real parameters. This SDE, first studied by Feller [UJ, is well-known in 
mathematical finance. It was employed by Cox, Ingersoll and Ross to model short-term 
interest rates [32] and later became popular in mimicking stochastic volatility, like in 
the Heston model for price dynamics [13]. Mean-reverting square-root diffusion has 
also been considered in other contexts, such as in modelling neural spiking [TJ] or in 
problems of biological diffusion [15] . 

Still in the context of finance, in the first approximation, Eq. ([1]) describes the 
dynamics of share trading volumes, although the tails of the empirical distributions 
deviate from the steady solution associated with the SDE ([!]), 



which is the Gamma PDF [UJ, with M a normalization constant. Fluctuations that can 
explain the observed deviations within the superstatistical mixing framework [9] have 
been detected in the parameter directly related to the mean value [8]. The resulting 
PDF, known as g-Gamma, is a generalization of the Gamma distribution that can be 
cast into the form of the F-distribution and which basically turns the exponential tail 
into a power-law one. It has been shown to be in excellent agreement with empirical 
observations at different granularity timescales (from 1 min to days) [H [9l [161 E] • 




(1) 




(2) 
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The study of complex systems often encompasses the analysis of the probability 
function of the addition of stochastic observables, mostly to appraise the hypothesis of 
scale invariance. Precisely, in respect of this, the fact that the description of empirical 
volume PDFs in terms of g-Gamma distributions applies at different aggregation scales 
is particularly interesting, especially because, unlike the Gamma distribution, the q- 
Gamma is neither closed under convolution nor correlations can be fully neglected. 

To understand these observations motivates the present work. Although our initial 
motivation comes from an econophysical problem, the present results may be of interest 
for a wider scenario where linear diffusion applies, as soon as parameter fluctuations are 
ubiquitous. 

The manuscript is organized as follows. We first summarize the pertinent results 
related to the Fokker-Planck equation (FPE) associated to the stochastic process (CQ). 
We apply these results to obtain the PDFs resulting from an accumulation process. 
Thereafter, we apply the statistical composition procedure, in which the reverting mean 
is the fluctuating parameter. We obtain joint distributions that us allow to characterize 
the correlation structure as well as the aggregation properties. Finally, we apply the 
analytical results to interpret the granular features of real time-series of trading volumes. 



2. Primary process 

The forward Fokker-Planck equation (FPE) associated to Eq. (CQ), for the conditional 
probability P = P(x, t\x', t'), reads 

The propagator of this FPE has been obtained by Feller [11]. Since then, the time- 
dependent solution has been systematically overlooked. For instance, in stochastic 
volatility models like the Heston model and its variants, the price is the quantity of 
interest [131 CB], while the volatility, modeled by Eq. ([T]), is only an auxiliary quantity. 
Wherefore, it is usually integrated out by considering a distribution of the (initial) 
volatility equal to the stationary solution. Thence, if we want to go further, we must look 
for the time- dependent solutions. With that goal in mind, we summarize the procedure 
for the obtention of the propagator, first presented by Feller [11], that embodies the 
definitions and the partial results which are going to be useful in following sections. 
Laplace transforming Eq. ([3]), one gets 

dP ( $ 2 \dP . ~ 

where P = P(w,t\x' ,t'). With the initial condition P (x, t'\x', if) = 5(x — x'), whose 
Laplace transform is P (w, t'\x', if ) = exp(-wx'), the solution of Eq. (TJJ, that can be 
obtained by the method of characteristics, is 

p (w tW t') - exp( ~^ } - ex P(~^ ~ irkD (5) 
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where we have defined, 
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P{w,t\x',t') = „ sf) , (6) 



In the long-time limit At = (t — t)' » I/7 (hence, — > 0), Eq. (jSJ) becomes 

1 

;i + ~B w) 

whose inverse Laplace transform gives us the steady solution in x space, that reads 

P s {x) = C' 1 ( = / V B °\ (7) 

for nonnegative x, and zero otherwise. This is the Gamma (or Erlang) distribution, 

r^o PI- 

For any At, the conditional PDF P(x,t\x' ,t') can be obtained by first expanding 
the exponential in Eq. (jSJ) and then performing the mappings f3 — > n + (3 and B§ —> B 
in Eq. (J7J). That is, 

P{x, t\x' : t') = C~\P{w, t\x', t')) (8) 
V exp(-|)(|r f 1 x 



n>0 



V exp(-|)(|r^- 1 exp(-f) 
^ n\ B n +?T(n + f3) 



n>0 



J fi [i-e] Vl U o [i-0]J ' (12) 

where J n (x) is the nth-order modified Bessel function of first kind [20] . 

In the limit 7 At >> 1, Eq. (TT2i) tends to the stationary PDF ([Jj), which is also 
obtained by performing the integration J dx'P (x, t\x', t') P s (x 1 ). Still in the steady 
state, the two-time joint PDF is 

P(x, t; x', t 1 ) = P{x, t\x\ t')P s {x') (13) 



[xx') 2 exp(-^Q I ) T ( 2Vx^e 



r(/?)^ +1 [i - ©]©^ Vl [b [i - e] ) ' (14) 

which in the long-term limit (0 — > 0), of course, becomes the product of the stationary 
PDFs, P s (x)P s {x'). 
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2.1. Aggregation 

Once known the propagator P(x, t\x', t'), given by Eq. ( fT2l) . and assuming stationarity 
and Markovianity, we can determine the iV-time joint PDF 

P(xi, ...;x N ) = P(xt, tx,...;x N , t N ) (15) 

JV-l 

= ] [ P (x i+1 ,t i+1 \xi,ti) P s (xx). (16) 

i=l 

In Eq. (1151) . we can consider elements generated by Eq. ([I]) that are equally spaced in 
time, such that U = (i — 1)A£, for 1 < % < N. Then, we can evaluate the resulting 

N 

stationary PDF of X = ^2 Xi, 

i=i 

Pn (X) = J... J dx\ . . . dxN${X — Xj)P (x±; . . . ; xn) ■ (17) 

For arbitrary N > 2, when At >> 1/7 (6 -> 0), P N (X) must tend to the iV-fold 
convolution of the Gamma distribution (00). Since it is closed under convolution, we 
have, 



X^- 1 exp(-f) 



lim P N (X) 



Bq P T(N(3) 

In the opposite limit At<<l/7(6^1 

1 X X /3_1 exp( 



x 



lim P N (X) = —PiC—) = —, ^f" . (19) 

That is, in both limits, the sum of adjacent variables is Gamma distributed, 
although with different values of the parameters. For intermediate degrees of correlation, 
following the behavior of the Bessel factor for small and large values (power-law and 
exponential, respectively) [21] , the PDF of the sum is expected to remain close in shape 
to the Gamma distribution, i.e., growing as a power-law at the origin and decaying 
asymptotically with an exponential tail. The extreme cases suggest that as the number 
./V of added variables accrues the power-law exponent increases as well: the less, the more 
correlated the aggregated variables are. Concomitantly, the exponential tail decays more 
slowly: the slower, the larger the correlations. 

Let us analyze more carefully the particular case N = 2. Probability 
distribution ( flT|) is explicitly, 

P (x) _ ^ x "- 1 ' 2 "P(-W^>w(sf%) 

r(/3)^ +1/2 [l-6] 1 /2( 2v ^)/3-i/2 

Taking into account the asymptotic behavior of the Bessel function, we can make 
the following observations. For O — > 0, i.e., approaching independence, P2{X) ~ 
X /3-i/2 exp (__|L_) X /3-i/2 = x 2/3 " 1 exp(- Bo[ f_ e] ). For small enough X the 

distribution goes to zero as P2{X) ~ X 2 @~ l and exponentially decays with a 
characteristic constant value equal to B [l — 0]. This behavior changes when 
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Figure 1. PDF of the aggregation of two consecutive values, given by Eq. (I20p . 
for (/3, Bo) = (2,0.5) and different values of (hence At) indicated in the figure. 
Dotted lines correspond to fittings with the Gamma distribution (for qualitative 
assessment), that exactly coincides in the extreme cases = 0, 1. Insets show the 
PDFs in logarithmic scales to better display the tails. Dashed lines are drawn for 
comparison and correspond to X 13 ' 1 and X 2/3_1 in the left panel and to exp(— X/ Bq) 
and exp(— X/(2Bq)) in the right-hand side one, which follow the asymptotic behaviors 
of the extreme cases = and 1, respectively, while intermediate cases are ruled by 
exp(-X/[(Ve + l)S ]). 



we approach the full dependence case the functional form of which is P2{X) ~ 
^- 1 /2 e xp(-^ I )exp(^ft I )X- 1 /2 = x^expi-j^—). Therefore, the limit 

— > 1 yields for small X, P2(X) ~ X^~ x which despite being a power-law has got a 
different exponent. The large X decay keeps its exponential form but with a different 
constant of decay equal to B [l + \/Q]. The behavior of Eq. (120]) between the two 
limiting cases is exhibited in Fig. [TJ 

It is noteworthy that for arbitrary At, distribution (1201) approaches the Gamma 
distribution (which is the exact distribution in the extreme cases) at least for a few 
decades. On the one hand, the behavior near the origin is the same as for At — > oo. On 
the other hand, an effective parameter for the exponential decay can be found to adjust 
the tails since the true asymptotic behavior is attained after many decades only. 



2.2. Correlations and moments 

The joint probability density given by Eq. (TH1) allows evaluation of moments and two- 
time auto-correlation functions. To that end, it is useful to calculate 

Cfim — % ) = J J dxdx P (yX\ x )x X , 

which is explicitly, 

~ r(g + m) r(g + n)Br n v 
r(/3) 2 

[l-e] f,+m *" 2 F 1 {p + m,l3 + n,p,e). (21) 
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In particular, for any 7 At, the (centered) linear auto-correlation is (see also |18j ) 

Cu = c 11 -c 10 c 01 = peB 2 . (22) 

From Eq. (ED), higher order correlations behave as 

C nm x fW (1 - — 6 j , (23) 

for large 7 At, at first order in O. Meanwhile, for small 7 At, at first order in 1 — G, one 
has 

~ r(g+m + rQ^ / mn(l-e) \ 

nm ~ Y((3) 2 \ /3 + m + n-lJ- [ ' 

Hence, correlations of any order decay like a single exponential function, with 
characteristic time I/7. 

However, a different behavior occurs if 7 fluctuates, within the framework that 
we will be considered in the following section. Particularly, a power-law decay 
of the correlation function is obtained if one assumes that the PDF of 7 decays 
exponentially [22]. In such a scenario it should be stressed that our stationary 
distributions remain the same because they do not depend on parameter 7. 

From Eq. (1211) . the statistical raw moments can be directly obtained as 

(x n ) = C n0 = V -^^Bl (25) 
Hence the first centered moments are 



<(* - (x)f) = (3Bl 
((x - (x)f) = 2(3B\. 

Concerning aggregation, for N = 2, from Eq. (!20l) and using the properties in 
Ref. [20], statistical moments (X n ) are given by 

(X n ) = r( rf 2 ^ ) ^ [1 " ^ 2W + ^(3+\(n + l)^+ 1 - 1 6).(26) 
The first raw moments read 
(X) =2/350, 

(X 2 ) = 2(3(2(3 + 1 + Q)B 2 , 

(X 3 ) =4/3(/3+l)(2/3 + l + 30)5 o \ 
From where the first centered moments are 

((X-(X)) 2 ) = 2(3(1 + 6)5 2 , (27) 

((X-(X}) 3 }= 4(3(1 + 30)B O 3 . (28) 

Notice the increase in the moments with due to the longer exponential tails. 

For arbitrary N, in the limit of vanishing O, the raw moments can be also obtained 
from the moment generating function M(z) = (1 — Bqz)~ N/3 , for any N > 1. 
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3. Statistical mixing 

Let us now reckon that instead of constant, B evolves stochastically, independently of x, 
along the times eries of the primary process, for which the time lag between successive 
points is At. In agreement with previous observations [8], 77 = Bq 1 can be assumed 
Gamma distributed with parameters (a, Moreover, we consider the update scale 

of Bq (within which the parameter remains basically constant) much larger than the 
timescale I/7, so that the average over different samples is obtained by performing the 
mixture, 

P (M) (...) = J dr)P{...\ri)P{ri). (29) 

Particularly, we can determine the mixed joint distribution P^ M ^(x,t; x',t'), which, 
after integration, reads 

p(M )(x ,, = T(a + 2P)K«(xxr- 1 [l-Qr 

where k = k [1 — 0]. Subsequent integration over x', allows us to obtain the global 
distribution of x (locally stationary) in the mixing process, 

p(M )(r) r(« + /3) k^- 1 

{) r(a)T(f3)(K + x)«+P- { ' 

This PDF is a generalization of the Gamma distribution (recovered in the limit 
a — > 00, while kq/oi is kept finite), known as g-Gamma. After suitable rescaling, it can 
also be cast into the form of an F-distribution for which non-integer degrees of freedom 
are permitted. 

It is worth noticing that, even in the limit At >> I/7 (independence), 

lim *<) = r '° + 2 fl " S( ^ r :' , (32) 

is different from the product P^ M \x)P^ M \x'). Nonetheless, this does not necessarily 
correspond to a case of correlated variables x and x'. Bearing in mind the multi-variate 
Student-t distribution [23], we can understand our Eq. fl32l as a bi-variate F-distribution 
of uncorrelated variables. 

3.1. Aggregation of mixed variables 

Performing the mixing operation given by Eq. (1291) on Eq. (|20|) . after integration, we 
obtain 

2 1 } T{a)T{2(3) {k + X)«+W 

2 + 2 +A 2 +ft (^TxF)' (33) 
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which, in the independence limit At >> I/7 (but still smaller than any characteristic 
time of the mixing process), evolves into 

lim p (M) m _ T(a + 2(3) /eg* 2 *- 1 

lirn^P, (X) - T{a)rm {Ko + x)a+w ■ (34) 
This result can be easily generalized to N terms by iterated convolution operations: 

aScA {X) ~ T(a)T(N(3) (ko + XF+W (35) 
On the other hand, in the opposite limit At << I/7 

lim P { N M) (X) = IpW A 
At->o N v 1 N K N J 

Tja + P) Kg(f)^ 1 

NT(a)T((3)(K + §)^- { ] 

Notice that in both limiting cases a g-Gamma arises, although with different exponents. 
This is a consequence of the mixing of the Gamma distributions that rules the respective 
extreme cases. For intermediate instances, the (effective) power-law exponent at the 
origin follows the same scaling relation as the unmixed Gamma distribution. Meanwhile, 
the exponent of the tail is insensitive to both At and N, conserving its value a + 1, 
which only depends on the degree of inhomogeneities (given by a) , as it was empirically 
noticed in the application given in Ref. [8]. 

These behaviors are illustrated in Fig. [2] for numerical implementation of Eq. (TjQ) 
with mixing. From panels (d) to (f) we verify that P 2 M (X) departs from Eq. (1361) 
and approaches Eq. (1351) . as At swells and added variables become independent. As 
aggregation proceeds, i.e., N increases, the distribution shrinks below the maximum, 
departing from the fully dependent case towards the independent one. This is because 
the exponent of the power law at the origin follows the behavior of the independent case 
ruled by the N/3 exponent (a reflection of the behavior at the origin of the unmixed case 
as exemplified in Fig. [T]), while as X approaches the maximum, there is a crossover 
towards the independence At — > limit. The figure also turns out evident that, 
contrarily, the tail exponent does not depend on N nor on At. Its changeless value 
a + 1 indicates that the manifestation of inhomogeneities stays invariant at the different 
accumulation scales. 

Analogously to the Gamma approximation for the aggregation of variables, 
discussed at the end of Section 12.11 the g-Gamma distribution plays the same role 
after mixing. As a consequence, even when not the exact solution, the g-Gamma suits 
the PDF generated by the mixed Feller diffusion. 



3.2. Correlations and moments after mixing 



As the term statistics of statistics suggests, the global statistical properties correspond 
to the averaging of the locally stationary statistical properties over the fluctuations. 
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Figure 2. PDFs of the mixing variable x (a) and agreggated variables X for different 
values of N and At indicated on the figure (b)-(f). The (gray) lines join the points of the 
histograms built from numerical implementations of Eq. ([1]) with mixing. Numerical 
integration of the stochastic differential equation was performed by means of an Eulcr 
algorithm, with time step 10 -3 , and update of 1/Bo, drawn from r Q l / K[) , was done at 
each St = 1007 -1 . Parameters values are j = 1, a = 4, p = 3, Kq = 1/3. In panel (a) 
the full line represents the global PDF given by Eq. (f3"Tj) . Panels (b)-(f) exhibit the 
global PDF of the addition of N = 2 (d-f), 4 (b) and 8 (c) consecutive variables, at 
each At, for the same process. For N = 2, the full line represents Eq. (f3"31 the dotted 
line the limit At — * Eq. (|31)|) and the dashed line the independence limit given by 
Eq. ([33]) . All the plots are in the same log-log scale for comparison. 

Therefore, central two-time correlation functions of x are 

C£J = j J dxdx'driP(x,x')P(ri)(x- (x)) n (x' - (x)) m (37) 

= J dT]P{r])C nm . (38) 
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Assuming that P{rf) follows a T(a, 1/kq), from Eq. (I2"2"j) . we have 

^ = <V,Ve = (tt _y_ 2) , (39) 

which displays the same exponential decay as the unmixed process. 

It is important to introduce two remarks. First, correlation functions and moments 
must not be computed for time differences At greater than the characteristic time in 
which the fluctuating parameter can be considered constant. Second, although we can 
feel enticed to compute the overall (mixing) statistical properties by integrating the 
variables using the pM(. . .) weights, e.g., Cjf } = (xx') (M) - (x) ^ M) {x')^ M \ this is 
wrong, since one must keep in mind the non-stationary nature of the stochastic process 
(due to the fluctuations in rj). Consequently, the statistical properties must be first 
computed locally and only afterwards the parameter fluctuations taken into account. 
Otherwise, spurious results may come forth such as for instance centered correlations 
tending to a constant value different from zero in the long-time limit. For instance, one 
has 

Cif = (**')<"> = (40) 
{a — ±){a — 2) 

which for G = is equal to J (x) (x')P(r)) dr), as it should be according to the discussion 
above. 

In particular, moments after mixing must be also computed by averaging over the 
statistics of rj, i.e., f(...)P(r])dr), the expressions (I25l) - (l28l) obtained in the previous 
section for the locally stationary process. 

For N = 2 aggregated variables, by integrating Eq. (j26j) over -q with a T(a, 1/aco) 
weight (which leads to (Bq) = n^T{a — n)/Y(a) for n < a), one obtains 

{X n )m = r(2/? + n)r(a-n) _ p+n 

n 1 1 
2 J- 1 (/3 + -,/3+-(n + l),/3 + -,e). 

Notice that only moments with order n < a are defined. The first raw moments (that 
can also be obtained by performing the mixing directly on Eqs. ( 1271) ) are 

{X) (M) = (41) 
(a-1) 

(a — l)(a — 2) 

(M) = 4, 8 (, 8+ l)(2, 8+ l + 39) K g 
V 1 (a-l)(a-2)(a-3) V ; 



Finally, for the centered moments, by integration of Eqs. ff28|) over the fluctuations 
of rj, one finds 



({X- (x» 2 >( A/ ) 



2^(1 + 0)*: 



{a - l)(a-2) 



Statistical mixing and aggregation in Feller diffusion 



12 



<(*- W) 3 ) (A/) 



4/3(1 + 30)4 



(a-l)(a-2)(a-3)' 



Notice that, in the limit of a, kq oo with hq/ a = Bq constant (that is when the 
Gamma distribution of the fluctuating parameter tends to a Dirac 5 centered at 1/Bq), 
one recovers the unmixed moments and correlations. 

The case — > 0, i.e., the addition of independently F-distributed variables (N = 2) 
has been studied before from a pure statistics perspective [2U [251 126] • In these cases the 
resulting distribution was approximated to the F-distribution by imposing statistical 
moments matching. This procedure could be extended to arbitrary by means of the 
above expressions for the lowest order moments. 

4. Application to traded volume in financial markets 

Although the largest part of the work made on financial markets is devoted to the 
(log-)price fluctuations and the volatility, it is recognized the essential role of the 
traded volume for a trustworthy characterization of a financial market global dynamics 
portrait [27]. As a matter of fact, the price evolves in time when a certain quantity of 
equities is negotiated. 

So far as we are aware, the first studies on high-frequency traded volume were 
presented in [28] wherein asymptotic power-law decay of both the PDF and the auto- 
correlation have been held. Shortly after, another study on the traded volume PDF 
was presented [16], but at that time, the entire span of the traded volume values 
was taken into account and the PDFs under analysis were very well adjusted by q- 
Gamma distributions. This observation holds both for consolidated highly liquid stock 
markets [9} [16], [17] (NYSE and NASDAQ) and for emerging ones like the Brazilian 
[H] and the Chinese [29] . This fact indicates universality of the functional form of the 
distribution function, at least approximately, and therefore of the underlying dynamical 
mechanism independently of the size of the market. 

We shall now investigate the applicability of the dynamical scenario presented in 
the preceding sections to model stock traded volumes. We have analyzed two different 
paradigmatic examples: i) the total volumes traded in the emerging Brazilian stock 
market BOVESPA (a total of 9970 observations, recorded at intervals of 30 minutes, 
spanning the period from 3 rd January 2005 to 13 th September 2007) and ii) the 1 minute 
records of Pfizer (PFE) traded volume at New York Stock Exchange between the 1 st 
July 2004 and 31 s * December 2004 in a total of 49585 registered values. During the 
respective period each market can be considered in a regular state, i.e., neither a crash 
nor other extreme behavior was earmarked. Notwithstanding, future work should shed 
light on the traded volume dynamics and its connection with the theory that predicts 
log-oscillatory behavior for the price [301 EI] in the advent of a crash. It should be noted 
that crashes are empirically associated with herding phenomena between agents and 
consequently with huge traded volumes [32]. Accordingly, the theory of log-oscillations 
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must reflect the emergence of a new dynamics enhancing large values of 9 and vice- 
versa. Our assumption is supported by prior empirical financial studies of daily time 
series which found an increase in trading volume over the previous six months to a price 
plunge [33] . 

We start by determining the values of the set of parameters a, (3, kq, by adjusting 
the traded volume empirical distribution at the lowest time resolution of the data, that 
will be considered the unit timescale, in each case. 

For the Brazilian market, whose lowest scale is 30 min, the empirical distribution 
of 30 min stock volumes (hence, At = 1) is depicted in Fig. [3](a). From the non- 
linear regression procedure (minimization of x 2 error leading to the optimization of the 
parameters correlation matrix)we obtained (a, (3, k,q) = (7.11 ± 0.32, 3.90 ± 0.04, 1.45 ± 
0.01) [the respective Kolmogorov-Smirnoff distance, Dks, to the empirical probability 
distribution is equal to 0.016, x 2 — 0.0380, R 2 = 0.994]. Since the very small volume 
regime (x < 0.1) is ruled by a different mechanism [8], it was not considered in fitting 
procedures. 

In order to model the empirical distribution of the 1-hour traded volume, through 




Figure 3. (a) Empirical PDF of the BOVESPA 30-minute traded volume x (circles) 
and best non-linear regression result for Eq. (|3ip (full line), yielding (a,/3, kq) = 
(7.11,3.90,1.45). (b) Empirical PDF of the BOVESPA 1-hour traded volume, 
A,(cicles). The full line corresponds to Eq. ([33]). P 2 ' M ' (X = x + x'), with (a, f3, k ) as 
above and 7 = 1.5. The dotted line represents a g-Gamma distribution with parameters 
(a,P,K ) = (7.97,6.97,1.82) (Dks = 0.014) obtained by a non-linear regression 
procedure. For comparison, the (gray) short and long dashed lines correspond to the 
limits of full dependence and independence, given by Eqs. l[36|) and ([34]) ). respectively. 
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Eq. (1331) . we used the values of the parameters resulting from the numerical adjustment 
of the 30-minute traded volume PDF, together with the value of 7 obtained from the 
adjustment of the linear auto-correlation function with an exponential decay. Although 
the exponential does not describe well the correlations in long-term regimes, it can be 
considered as a good approximation for the short timescales of interest. From which we 
appraised 7 = 1.5 ± 0.03, since the characteristic exponential time decay was r = 20 
min and we adopted 30 min as the time unit. 

Once obtained the values of parameters (a, ft, Ko, 7), we compared the 1-hour traded 
volume empirical PDF (symbols) with Eq. (|33|) (full curve), as shown in Fig. [3](b). We 
observe a fair agreement between them (Dks — 0.037), specially recalling that very 
small volumes should not be considered. However, notice that a simple non-linear q- 
Gamma adjustment (dotted line) provides also a very good description, with parameters 
(a, ft, k ) = (7.97±0.35, 6.97±0.06, 1.82±0.01) [D KS = 0.014, X 2 = 0.0224, R 2 = 0.997] 
(dotted curve in Fig. 13(b)). This illustrates once more how the g-Gamma model, 
although approximate, appears to hold at different aggregation scales. Furthermore, 
the fitting we present fails to reject the null hypothesis for the Pearson's statistical test 
with P = 0.05. Notice also that although correlations are not completely negligible at 
these timescales, they do not play an important role in the resulting PDF, which is very 
close to the one that would be obtained by assuming independence (gray long-dashed 
line) . 

It is remarkable that, volumes for a company in a developed market, recorded at 
high frequency, as PFE, display the same qualitative features, despite correlations are 
stronger at those (high frequency) time-scales (not shown, however, as illustration see 
Ref. [H3, [17]). In particular, from g-Gamma fitting, an almost constant value of a is 
observed at the different scales. Moreover, PFE (in 2004), as well as the top 10 NASDAQ 
and NYSE stock volumes (in 2001) respectively display a ~ 4, 4 and 3, while a ~ 7-8 for 
the Brazilian market, indicating a lower degree of inhomogeneities in the former case. 
Then a constitutes an index to detect and quantify the level of inhomogeneities of a 
market or period. 

5. Concluding remarks 

From the time-dependent PDF of the SDE describing mean-reverting square-root 
diffusion, we derived the two-point (two-time) joint PDF in the steady state, as well 
as the PDF of the addition of variables generated according to this dynamical process. 
We further considered a scenario in which the mean reverting term presents fluctuations 
that can be introduced twofold: they correspond to variations of the parameter either 
over runs or within the same run in a time scale much larger than the scale needed for 
the system to reach stationarity. Although our survey was mainly inspired by previous 
empirical findings of inhomogeneities in the traded volume flow corroborating a Gamma- 
Gamma phenomenological proposal, we uphold that a similar approach might be applied 
on the study of systems exhibiting inhomogeneous occurrence of Poisson events [35] or 
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simply on problems for which the g-Gamma distribution has shown to be statistically 
relevant like in granular media [36]. We have also discussed paradoxical results related 
to the non-commutativity of averaging and mixing operations. 

In both limits of full independence (G — > 0) and full dependence (0 — > 1), the PDF 
of the sum of consecutive variables is a g-Gamma distribution. We have shown that in 
intermediate situations, for arbitrary degree of correlations, the upshot of aggregation 
at different scales is also well-described by that distribution, although not the exact 
one. Moreover, while the increase at the origin is ruled by the independence behavior, 
the tail is governed by the degree of inhomogeneities which manifest at any aggregation 
scale. Then the g-Gamma form is approximately preserved. 

We would like to stress that the model we have introduced explains why i) even 
for 6^0, the exponent (3 of the associated PDF increases by increasing the number of 
added variables as well; ii) the tail exponent, which may be considered and indicator 
and quantifier of the presence of inhomogeneities, is preserved at different scales. 

We have shown that all this scenario applies in the specific case of traded volume 
within periods of regular trading behavior (i.e., absence of bubbles/crashes) which 
represent the majority of the trading history. Nonetheless, for the fallout of stock market 
crashes, a further survey based on appropriate data that bridges the trading volume 
dynamics with the occurrence of extreme episodes (namely allowing for the theory of 
log-oscillations [3]]) is of manifest interest for an all-inclusive comprehension of financial 
markets dynamics. Especially, it would be interesting to understand the relation between 
the theory of log-oscillations and possible modifications in the description of the degree 
of inhomogeneities in a market (fluctuations in 9) that are depicted by the parameter a 
during most of the trading records. 
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